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<110> APPLICANT: Hope, Ralph Graham 

McLauchlan, John 
<120> TITLE OF INVENTION: VIRAL THERAPEUTICS 
<130> FILE REFERENCE: DYOU17 . 001CP1 
<140> CURRENT APPLICATION NUMBER: US/09/97 3 , 322 
<141> CURRENT FILING DATE: 2001-10-09 
<150> PRIOR APPLICATION NUMBER: US 09/201,916 
<151> PRIOR FILING DATE: 1998-12-01 
<150> PRIOR APPLICATION NUMBER: GB 9825951.8 
<151> PRIOR FILING DATE: 1998-11-26 
<160> NUMBER OF SEQ ID NOS : 20 

<170> SOFTWARE: FastSEQ for Windows Version 4.0 
<210> SEQ ID NO: 1 
<211> LENGTH: 630 
<212> TYPE: DNA 

<213> ORGANISM: Hepatitis C Virus 

<220> FEATURE: 

<221> NAME/KEY: CDS 

<222> LOCATION: (43) . . . (630) 

<400> SEQUENCE: 1 

ggtgcttgcg agtgccccgg gaggtctcgt agaccgtgca cc atg age acg aat 

Met Ser Thr Asn 
1 

cct aaa cct caa aga aaa acc aaa cgt aac acc aac cgt cgc cca cag 
Pro Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Asn Arg Arg Pro Gin 
5 10 15 20 

gac gtt aag ttc ccg ggt ggc ggt cag ate gtt ggt gga gtt tac ttg 
Asp Val Lys Phe Pro Gly Gly Gly Gin lie Val Gly Gly Val Tyr Leu 

25 30 35 

ttg ccg cgc agg ggc cct aga ttg ggt gtg cgc gcg acg agg aag act 
Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala Thr Arg Lys Thr 

40 45 50 

tec gag egg teg caa cct cga ggt aga cgt cag cct ate ccc aag gca 
Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro lie Pro Lys Ala 

55 60 65 

cgt egg ccc aag ggc agg aac tgg get cag ccc ggg tat cct tgg ccc 
Arg Arg Pro Lys Gly Arg Asn Trp Ala Gin Pro Gly Tyr Pro Trp Pro 

70 75 80 

etc tat ggc aat gag ggt tgc ggg tgg gcg gga tgg etc ctg tec ccc 
Leu Tyr Gly Asn Glu Gly Cys Gly Trp Ala Gly Trp Leu Leu Ser Pro 
85 90 95 100 

agt ggc tct egg cct agt tgg ggc ccc aac gac ccc cga cgt agg teg 
Ser Gly Ser Arg Pro Ser Trp Gly Pro Asn Asp Pro Arg Arg Arg Ser 

105 110 115 

cgc aat ttg ggt aag gtc ate gat acc ctt acg tgc ggc ttc gtc gat 
Arg Asn Leu Gly Lys Val lie Asp Thr Leu Thr Cys Gly Phe Val Asp 
120 125 * 130 
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68 etc atg ggg tac ata ccg etc gtc ggc gec cct ctt aga ggc get gec 486 

69 Leu Met Gly Tyr He Pro Leu Val Gly Ala Pro Leu Arg Gly Ala Ala 

70 135 140 145 

72 agg gec ctg gcg cat ggc gtc egg gtt ctg gaa gac ggt gtg aac tat 534 

73 Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr 

74 150 155 160 

76 gca aca ggt aac ctt cct ggt tgc tct ttc tct ate ttc ctt ctg gee 582 

77 Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala 

78 165 170 175 180 

80 ctg etc tct tgc ctg act gtg ccc get tea gec tac caa gtg cgc aac 630 

81 Leu Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Gin Val Arg Asn 

82 185 190 195 

86 <210> SEQ ID NO: 2 

87 <211> LENGTH: 60 

88 <212> TYPE: DNA 

89 <213> ORGANISM: Hepatitis C Virus 

91 <220> FEATURE: 

92 <221> NAME/KEY: CDS 

93 <222> LOCATION: (1)...(60) 

94 <223> OTHER INFORMATION: Corresponds to aa 125 to 144 of SEQ ID. No. 1 

96 <400> SEQUENCE: 2 

97 acc ctt acg tgc ggc ttc gtc gat etc atg ggg tac ata ccg etc gtc 48 

98 Thr Leu Thr Cys Gly Phe Val Asp Leu Met Gly Tyr He Pro Leu Val 

99 1 5 10 15 

101 ggc gec cct ctt 60 

102 Gly Ala Pro Leu 

103 20 

106 <210> SEQ ID NO: 3 

107 <211> LENGTH: 18 

108 <212> TYPE: DNA 

109 <213> ORGANISM: Hepatitis C Virus 

111 <220> FEATURE: 

112 <221> NAME/KEY: CDS 

113 <222> LOCATION: (1)...(18) 

114 <223> OTHER INFORMATION: Corresponds to aa 161-166 of SEQ ID. No. 1 

116 <400> SEQUENCE: 3 

117 ggt gtg aac tat gca aca 18 

118 Gly Val Asn Tyr Ala Thr 

119 1 5 

122 <210> SEQ ID NO: 4 

123 <211> LENGTH: 1900 

124 <212> TYPE: DNA 

125 <213> ORGANISM: Human 

127 <220> FEATURE: 

128 <221> NAME/KEY: misc_f eature 

129 <222> LOCATION: (1)...(1900) 

130 <223> OTHER INFORMATION: n « A,T,C or G 

132 <400> SEQUENCE: 4 

133 cgtcttcggg acgcgcccgc tcttcgcctt tegctgeagt ccgtcgattt ctttctccag 60 
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134 gaagaaaaat ggcatccgtt gcagttgatc cacaaccgag tgtggtgact cgggtggtca 120 

135 acctgccctt ggtgagctcc acgtatgacc tcatgtcctc agcctatctc agtacaaagg 180 

136 accagtatcc ctacctgaag tctgtgtgtg agatgscaga gaacggtgtg aagaccatca 240 

137 cctccgtggc catgaccagt gctctgccca tcatccagaa gctagagccg caaattgcag 300 

138 ttgccgatac ctatgcctgt aaggggctag acaggattga ggagagactg cctattctga 360 

139 atcagccatc aactcagatt gttgccaatg ccaaaggcgc tgtgactggg gcaaaagatg 420 
W--> 140 ctgtgacgac tactgtgact ggggccaagg attctgtngc cagcacgatc acaggggtga 480 

141 tggacaagac caaaggggca gtgactggca gtgtggagaa gaccaagtct gtggtcagtg 540 

142 gcagcattaa cacagtcttg gggagtcgga tgatgcagct cgtgagcagt ggcgtagaaa 600 

143 atgcactcac caaatcagag ctgttggtag aacagtacct ccctctcact gaggaagaac 660 

144 tagaaaaaga agcaaaaaaa gttgaaggat ttgatctggt tcagaagcca agttattatg 720 

145 ttagactggg atccctgtct accaagcttc actcccgtgc ctaccagcag gctc.tcagca 780 

146 gggttaaaga agctaagcaa aaaagccaac agaccatttc tcagctccat tctactgttc 840 

147 acctgattga atttgccagg aagaatgtgt atagtgccaa tcagaaaatt caggatgctc 900 

148 aggataagct ctacctctca tgggtagagt ggaaaaggag cattggatat gatgatactg 960 

149 atgagtccca ctgtgctgag cacattgagt cacgtactct tgcaattgcc cgcaacctga 1020 

150 ctcagcagct ccagaccacg tgccacaccc tcctgtccaa catccaaggt gtaccacaga 1080 

151 acatccaaga tcaagccaag cacatggggg tgatggcagg cgacatctac tcagtgttcc 1140 

152 gcaatgctgc ctcctttaaa gaagtgtctg acagcctcct cacttctagc aaggggcagc 1200 

153 tgcagaaaat gaaggaatct ttagatgacg tgatggatta tcttgttaac aacacgcccc 1260 

154 tcaactggct ggtaggtccc ttttatcctc agctgactga gtctcagaat gctcaggacc 1320 

155 aaggtgcaga gatggacaag agcagccagg agacccagcg atctgagcat aaaactcatt 1380 

156 aaacctgccc ctatcactag tgcatgctgt ggccagacag atgacacctt ttgttatgtt 1440 

157 gaaattaact tgctaggcaa ccctaaattg ggaagcaagt agctagtata aaggccctca 1500 

158 attgtagttg tttccagctg aattaagagc tttaaagttt ctggcattag cagatgattt 1560 

159 ctgttcacct ggtaagaaaa gaatgatagg cttgtcagag cctatagcca gaactcagaa 1620 

160 aaaattcaaa tgcacttatg ttctcattct atggccattg tgttgcctct gttactgttt 1680 

161 gtattgaata aaaacatctt catgtgggct ggggtagaaa ctggtgtctg ctctggtgtg 1740 

162 atctgaaaag gcgtcttcac tgctttatct catgatgctt gcttgtaaaa cttgatttta 1800 

163 gtttttcatt tctcaaatag gaatactacc tttgaattca ataaaattca ctgcaggata 1860 
W--> 164 gaccagttna gnagcaaaca nncangtaca cnnaaganac 1900 

166 <210> SEQ ID NO: 5 

167 <211> LENGTH: 437 

168 <212> TYPE: PRT 

16 9 <213> ORGANISM: Human 

171 <220> FEATURE: 

172 <221> NAME/KEY: VARIANT 

173 <222> LOCATION: (1)...(437) 

174 <223> OTHER INFORMATION: Xaa = Any Amino Acid 

176 <400> SEQUENCE: 5 

177 Met Ala Ser Val Ala Val Asp Pro Gin Pro Ser Val Val Thr Arg Val 

178 1 5 10 15 

179 Val Asn Leu Pro Leu Val Ser Ser Thr Tyr Asp Leu Met Ser Ser Ala 

180 20 25 30 

181 Tyr Leu Ser Thr Lys Asp Gin Tyr Pro Tyr Leu Lys Ser Val Cys Glu 

182 35 40 45 

W--> 183 Met Xaa Glu Asn Gly Val Lys Thr He Thr Ser Val Ala Met Thr Ser 

184 50 55 , 60 

185 Ala Leu Pro He He Gin Lys Leu Glu Pro Gin He Ala Val Ala Asp 
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186 65 70 ?5 

187 Thr Tyr Ala Cys Lys Gly Leu Asp Arg He Glu Glu Arg Leu Pro iL 

xo ° 85 90 95 

1Qn Ser Thr Gln He Val Ala Asn Ala Lys Gly 

xyu 100 105 110 



189 Leu Asn Gin Pro Ser Thr Gin lie Val Ala Asn Ala Lys Gly Ala Val 

±yu 100 105 110 

191 Thr Gly Ala Lys Asp Ala Val Thr Thr Thr Val Thr Gly Ala Lys Asp 



120 



125 



III ul S6r Thr Ile IT 5 Gly Val Met Asp Lys Thr Lys Ala 

195 Val Thr Gly Ser Val 

196 145 

197 Asn Thr Val Leu Gly 

198 "5 170 • 

199 Glu Asn Ala Leu Thr Lys Ser Glu Leu Leu Val Glu Gin Tyr Leu Pro 
JUU 180 185 190 



195 val Thr Gly Ser Val Glu Lys Thr Lys Ser Val v" Ser Gly Ser lie 
197 Asn Thr Val Leu Gly Ser Arg Met Met Gin Leu Val Ser Ser Gly vll 



201 Leu Thr Glu Glu Glu Leu Glu Lys Glu Ala Lys Lys Val Glu Gly Phe 

200 205 
203 Asp Leu val Gin Lys Pro Ser Tyr Tyr Val Arg Leu Gly Ser Leu Ser 



215 220 
Ala Tyr Gin Gin Ala Leu 

207 Glu Ala Lys Gin Lys Ser Gin Gin Thr lie Ser Gin Leu His Ser X 



III 225 MS 230 ^ ^ ^ ^ ^ Ser Arg Val LyS 

207 Glu Ala Lys Gin 
208 

209 Val His Leu Ile 

210 260 265 270 



235 240 
209 val His Leu lie Glu Phe Ala Arg Lys Asn Val Tyr Ser Ala As" Gin 



250 



211 Lys lie Gin Asp Ala Gin Asp Lys Leu Tyr Leu Ser Trp Val Glu Trp 

III S ^ ill ASP Thr ASP Glu Ser C ^ s Ala Glu 

III 305 LSU Ala 116 Ala Ar * ^ Leu ^r Gin Gin 

310 315 ^on 

217 Leu Gin Thr Thr Cys His Thr Leu Leu Ser Asn lie Gin Gly Val Pro 

219 Gin Asn lie Gin Asp Gin Ala Lys His Met Gly Val Met Ala Gly Asp 

221 lie Tyr Ser Val Phe Arg Asn Ala III ser Phe Lys Glu Vat Ser Asp 

223 Ser Leu Leu Thr Ser Ser Lys Gly" Gin Leu Gin Lys Mel Lys Glu Ser 

Asp Val Met Asp Tyr Leu Val Asn Asn Thr Pro Leu 

390 395 
Gly Pro Phe Tyr Pro Gin Leu Thr Glu Ser Gin Asn 
405 41Q 
229 Asp Gin Gly Ala Glu Met Asp Lys Ser Ser Gin Glu Thr Gin Arg Ser 



231 Glu His Lys Thr His 

232 435 

235 <210> SEQ ID NO: 6 

236 <211> LENGTH: 31 



420 425 430 
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237 <212> TYPE: PRT 

238 <213> ORGANISM: Artificial Sequence 
240 <220> FEATURE: 

l\l ^ clrl\lT,™ : branChad <=°°" i " 1 "» S->7 Of „cv 

244 <221> NAME/KEY: VARIANT 

245 <222> LOCATION: (1)...(31) 

246 <223> OTHER INFORMATION: Xaa = Ala or Pm ^ ™ • * • 

247 position 12 at P° sltlon 1' and He or Asn at 
249 <400> SEQUENCE : 6 

W-> 250 Xaa Lys Pro Gin Arg Lys Thr Lys ^ ^ Tfar Xaa Arg ^ g ^ ^ 

252 Asp val Lys Phe Pro Gly Gly Lys Lys % B Lys Lys Lys Lyg £ 

256 <210> SEQ ID NO: 7 25 30 

257 <211> LENGTH: 11 

258 <212> TYPE: DNA 

259 <213> ORGANISM: Artificial Sequence 
261 <220> FEATURE: 

in <223> ZTJz^ir el iiT,~ iias used *° ™ - 

265 <400> SEQUENCE: 7 

266 gctgagatct a 

268 <210> SEQ ID NO: 8 11 

269 <211> LENGTH: 29 

270 <212> TYPE: DNA 

271 <213> ORGANISM: Artificial Sequence 
273 <220> FEATURE: 

"t <223> zs™ s™ N P1 ^s ttcieotides — <• ™ - ~» 

277 <400> SEQUENCE: 8 

278 gtaaccttcc tggttgctct tgagatcta 

280 <210> SEQ ID NO: 9 2 9 

281 <211> LENGTH: 17 

282 <212> TYPE: DNA 

ill sssr Artl,loiai 

289 <4 00> SEQUENCE : 9 

290 gtaacctttg agatcta 

292 <210> SEQ ID NO: 10 17 

293 <211> LENGTH: 18 

294 <212> TYPE: DNA 

2 " <i£ sees" Artifioisi se "— 

301 <400> SEQUENCE: 10 
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VERIFICATION SUMMARY 
PATENT APPLICATION: US/09/973,322 

Input Set : A:\SEQLISTDYOU17001CP1 TXT 
Output Set: N:\CRF3\10302001\I973322.raw 



DATE : 
TIME: 



10/30/2001 
15:33:50 



L.UO M,341 W, ,46) or °>t... n,^ ,!l fS^ff. CUrrent Flli °9 



L:140 M.-341 W 
L:164 M:341 W 
L:183 M.-341 W 
L:250 M:341 W 



(46) 
(46) 
(46) 
(46) 



■n- or "Xaa" used, for SEQ ID#-4 

n" or "Xaa" used, for SEQ id#-4 

n" or "xaa" used, for SEQ id#-5 

"n" or "xaa" used, for SEQ id#-6 
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